SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Hilton, Douglas J. 
(ii) TITLE OF INVENTION: A NOVEL HAEMOPOIETIN RECEPTOR 
(iii) NUMBER OF SEQUENCES: 25 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SCULLY, SCOTT, MURPHY & PRESSER 

(B) STREET: 400 Garden City Plaza 

(C) CITY: Garden City 

(D) STATE: New York 

(E) COUNTRY: United States of America 

(F) ZIP: 11530 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/702,665 

(B) FILING DATE: 20-DEC-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Presser, Leopold 

(B) REGISTRATION NUMBER: 19,827 

(C) REFERENCE/DOCKET NUMBER: 10296 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (516) 742-4343 

(B) TELEFAX: (516) 742-4366 

(C) TELEX: 203 901 SANS UR 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 3. .3 

(D) OTHER INFORMATION: /note= "Xaa at Position 3 is any amino acid" 




1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Trp Ser Xaa Trp Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1705 base pairs 

(B) TYPE: nucleic acid 

(C) S.TRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 45.. 1340 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GAGAGGGTGA GGGCGGAGGC CGCTGGCGGC GGCTGCCGCA GAAG ATG AGC AGC AGC 56 

Met Ser Ser Ser 
1 

TGC TCA GGG CTG ACC AGG GTC CTG GTG GCC GTG GCT ACA GCC CTG GTG 104 
Cys Ser Gly Leu Thr Arg Val Leu Val Ala Val Ala Thr Ala Leu Val 
5 10 15 20 

TCT TCC TCC TCC CCC TGC CCC CAA GCT TGG GGT CCT CCA GGG GTC CAG 152 
Ser Ser Ser Ser Pro Cys Pro Gin Ala Trp Gly Pro Pro Gly Val Gin 
25 30 35 

TAT GGA CAA CCT GGC AGG CCC GTG ATG CTG TGC TGC CCC GGA GTG AGT 200 
Tyr Gly Gin Pro Gly Arg Pro Val Met Leu Cys Cys Pro Gly Val Ser 
40 45 - 50 

GCT GGG ACT CCA GTG TCC TGG TTT CGG GAT GGA GAT TCA AGG CTG CTC 248 
Ala Gly Thr Pro Val Ser Trp Phe Arg Asp Gly Asp Ser Arg Leu Leu 
55 60 65 

CAG GGA CCT GAC TCT GGG TTA GGA CAC AGA CTG GTC TTG GCC CAG GTG 296 
Gin Gly Pro Asp Ser Gly Leu Gly His Arg Leu Val Leu Ala Gin Val 
70 75 80 

GAC AGC CCT GAT GAA GGC ACT TAT GTC TGC CAG ACC CTG GAT GGT GTA 344 
Asp Ser Pro Asp Glu Gly Thr Tyr Val Cys Gin Thr Leu Asp Gly Val 
85 90 95 100 

TCA GGG GGC ATG GTG ACC CTG AAG CTG GGC TTT CCC CCA GCA CGT CCT 3 92 

Ser Gly Gly Met Val Thr Leu Lys Leu Gly Phe Pro Pro Ala Arg Pro 
105 110 115 

GAA GTC TCC TGC CAA GCG GTA GAC TAT GAA AAC TTC TCC TGT ACT TGG 440 
Glu Val Ser Cys Gin Ala Val Asp Tyr Glu Asn Phe Ser Cys Thr Trp 
120 125 130 
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AGT 
Ser 


CCA 
Pro 


GGC 
Gly 
135 


CAG 
Gin 


GTC 
Val 


AGC 
Ser 


GGT 
Gly 


TTG 
Leu 
140 


CCC 
Pro 


ACC 
Thr 


CGC 
Arg 


TAC 
Tyr 


CTT 
Leu 
145 


ACT 
Thr 


TCC 
Ser 


TAC 
Tyr 


488 


AGG 
Arg 


AAG 
Lys 
150 


AAG 
Lys 


ACG 
Thr 


CTG 
Leu 


CCA 
Pro 


GGA 
Gly 
155 


GCT 
Ala 


GAG 
Glu 


AGT 
Ser 


CAG 
Gin 


AGG 
Arg 
160 


GAA 
Glu 


AGT 
Ser 


CCA 
Pro 


TCC 
Ser 


536 


ACC 
Thr 
165 


GGG 
Gly 


CCT 
Pro 


TGG 
Trp 


CCG 
Pro 


TGT 
Cys 
170 


CCA 
Pro 


CAG 
Gin 


GAC 
Asp 


CCT 
Pro 


CTG 
Leu 
175 


GAG 
Glu 


GCC 
Ala 


TCC 
Ser 


CGA 
Arg 


TGT 
Cys 
180 


584 


GTG 
Val 


GTC 
Val 


CAT 
His 


GGG 
Gly 


GCA 
Ala 
185 


GAG 
Glu 


TTC 
Phe 


TGG 
Trp 


AGT 
Ser 


GAG 
Glu 
190 


TAC 
Tyr 


CGG 
Arg 


ATC 
He 


AAT 
Asn 


GTG 
Val 
195 


ACC 
Thr 


632 


GAG 
Glu 


GTG 
Val 


AAC 
Asn 


CCA 
Pro 
200 


CTG 
Leu 


GGT 
Gly 


GCC 
Ala 


AGC 
Ser 


ACG 
Thr 
205 


TGC 
Cys 


CTA 
Leu 


CTG 
Leu 


GAT 
Asp 


GTG 
Val 
210 


AGA 
Arg 


TTA 
Leu 


680 


CAG 
Gin 


AGC 
Ser 


ATC 
He 
215 


TTG 
Leu 


CGT 
Arg 


CCT 
Pro 


GAT 
Asp 


CCA 
Pro 
220 


CCC 
Pro 


CAA 
Gin 


GGA 
Gly 


CTG 
Leu 


CGG 
Arg 
225 


GTG 
Val 


GAA 
Glu 


TCC 
Ser 


728 


GTA 
Val 


CCT 
Pro 
230 


GGT 
Gly 


TAC 
Tyr 


CCG 
Pro 


AGA 
Arg 


CGC 
Arg 
235 


CTG 
Leu 


CAT 
His 


GCC 
Ala 


AGC 
Ser 


TGG 
Trp 
240 


ACA 
Thr 


TAC 
Tyr 


CCT 
Pro 


GCC 
Ala 


776 


TCC 
Ser 
245 


TGG 
Trp 


CGT 
Arg 


CGC 
Arg 


CAA 
Gin 


CCC 
Pro 
250 


CAC 
His 


TTT 
Phe 


CTG 
Leu 


CTC 
Leu 


AAG 
Lys 
255 


TTC 
Phe 


CGG 
Arg 


TTG 
Leu 


CAA 
Gin 


TAC 
Tyr 
260 


824 


CGA 
Arg 


CCA 
Pro 


GCA 
Ala 


CAG 
Gin 


CAT 
His 
265 


CCA 
Pro 


GCC 
Ala 


TGG 
Trp 


TCC 
Ser 


ACG 
Thr 
270 


GTG 
Val 


GAG 
Glu 


CCC 
Pro 


ATT 
He 


GGC 
Gly 
275 


TTG 
Leu 


872 


GAG 
Glu 


GAA 
Glu 


GTG 
Val 


ATA 
He 
280 


ACA 
Thr 


GAT 
Asp 


GCT 
Ala 


GTG 
Val 


GCT 
Ala 
285 


GGG 
Gly 


CTG 
Leu 


CCA 
Pro 


CAC 
His 


GCG 
Ala 
290 


GTA 
Val 


CGA 
Arg 


920 


GTC 
Val 


AGT 
Ser 


GCC 
Ala 
295 


AGG 
Arg 


GAC 
Asp 


TTT 
Phe 


CTG 
Leu 


GAT 
Asp 
300 


GCT 
Ala 


GGC 
Gly 


ACC 
Thr 


TGG 
Trp 


AGC 
Ser 
305 


GCC 
Ala 


TGG 
Trp 


AGC 
Ser 


968 


CCA 
Pro 


GAG 
Glu 

■11 n 


GCC 
Ala 


TGG 
Trp 


GGT 
Gly 


ACT 
Thr 


CCT 
Pro 

■51 C 
Jij 


AGC 
Ser 


ACT 
Thr 


GGT 
Gly 


CCC 
Pro 


CTG 
Leu 


CAG 
Gin 


GAT 
Asp 


GAG 
Glu 


ATA 
He 


1016 


CCT 
Pro 
325 
CAG 
Gin 


GAT 
Asp 

GAG 
Glu 


TGG 
Trp 

GAC 
Asp 


AGC 
Ser 

AGC 
Ser 


CAG 
Gin 

CCG 
Pro 
345 


GGA 
Gly 
330 
GCT 
Ala 


CAT 
His 

CCT 
Pro 


GGA 
Gly 

GCA 
Ala 


CAG 
Gin 

AGG 
Arg 


CAG 
Gin 

CCT 
Pro 
350 


CTA 
Leu 
335 
TCC 
Ser 


GAG 
Glu 

TTG 
Leu 


GCA 
Ala 

CAG 
Gin 


GTA 
Val 

CCG 
Pro 


GTA 
Val 

GAC 
Asp 
355 


GCT 
Ala 
340 
CCA 
Pro 


1064 
1112 


AGG 
Arg 


CCA 
Pro 


CTT 
Leu 


GAT 
Asp 


CAC 
His 


AGG 
Arg 


GAC 
Asp 


CCC 
Pro 


TTG 
Leu 


GAG 
Glu 


CAA 
Gin 


GTA 
Val 


GCT 
Ala 


GTG 
Val 


TTA 
Leu 


GCG 
Ala 


1160 



360 365 370 



TCT CTG GGA ATC TTC TCT TGC CTT GGC CTG GCT GTT GGA GCT CTG GCA 1208 
Ser Leu Gly lie Phe Ser Cys Leu Gly Leu Ala Val Gly Ala Leu Ala 
375 380 385 

CTG GGG CTC TGG CTG AGG CTG AGA CGG AGT GGG AAG GAT GGA CCG CAA 1256 
Leu Gly Leu Trp Leu Arg Leu Arg Arg Ser Gly Lys Asp Gly Pro Gin 
390 395 400 

AAA CCT GGG CTC TTG GCA CCC ATG ATC CCG GTG GAA AAG CTT CCA GGA 1304 
Lys Pro Gly Leu Leu Ala Pro Met lie Pro Val Glu Lys Leu Pro Gly 
405 410 415 420 

ATT CCA AAC CTG CAG AGG ACC CCA GAG AAC TTC AGC TGATTTCATC 13 50 

lie Pro Asn Leu Gin Arg Thr Pro Glu Asn Phe Ser 
425 430 

TGTAACCCGG TCAGACTGGG GGCAGAAAGA GGCGGGGCAG TGGATCCCTG TGGATGGAGG 1410 

TCTCAGCTGA AAGTCTGAGC TCTTTTCTTT GACACCTATA CTCCAAACTT GCTGCCGGCT 1470 

GAAGGCTGTC TGGACTTCCG ATGTCCTGAG GTGGAAGTCC ACCTGAGGAA TGTGTACAGA 1530 

AGTCTGTGTT CCTGTGATCG TGTGTGTATG TGAGACAGGG AGCAAAAGTT CTCTGCATGT 1590 

GTGTACAGAT GATTGGAGAG TGTGTGCGGT CTTGGGCTTG GCCCTTCTGG GAAGTGTGAA 1650 

GAGTTGAAAT AAAAGAGACG GAAGTTTTTG GAAAAAAAAA AAAAAAAAAA AAAAA 1705 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Met Ser Ser Ser Cys Ser Gly Leu Thr Arg Val Leu Val Ala Val Ala 
15 10 15 

Thr Ala Leu Val Ser Ser Ser Ser Pro Cys Pro Gin Ala Trp Gly Pro 
20 25 30 

Pro Gly Val Gin Tyr Gly Gin Pro Gly Arg Pro Val Met Leu Cys Cys 
35 40 45 

Pro Gly Val Ser Ala Gly Thr Pro Val Ser Trp Phe Arg Asp Gly Asp 
50 55 60 

Ser Arg Leu Leu Gin Gly Pro Asp Ser Gly Leu Gly His Arg Leu Val 
65 70 75 80 

Leu Ala Gin Val Asp Ser Pro Asp Glu Gly Thr Tyr Val Cys Gin Thr 
85 90 95 
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Leu Asp 



Pro Ala 



Ser Cys 
130 

Leu Thr 
145 

Glu Ser 



Ala Ser 



Gly Val Ser 
100 

Arg Pro Glu 
115 

Thr Trp Ser 
Ser Tyr Arg 



Pro Ser Thr 
165 

Arg Cys Val 
180 



Gly Gly 

Val Ser 

Pro Gly 
135 

Lys Lys 
150 

Gly Pro 
Val His 



Met Val Thr Leu Lys Leu 
105 



Cys Gin Ala Val 
120 

Gin Val Ser Gly 



Thr Leu Pro Gly 
155 

Trp Pro Cys Pro 
170 

Gly Ala Glu Phe 
185 



Asp Tyr 
125 

Leu Pro 
140 

Ala Glu 
Gin Asp 
Trp Ser 



Gly Phe Pro 
110 

Glu Asn Phe 
Thr Arg Tyr 



Ser Gin Arg 
160 

Pro Leu Glu 
175 

Glu Tyr Arg 
190 



lie Asn 



Asp Val 
210 



Val Thr Glu 
195 

Arg Leu Gin 



Val Asn Pro Leu Gly Ala 
200 



Ser lie 
215 



Leu Arg Pro Asp 



Ser Thr 
205 

Pro Pro 
220 



Cys Leu Leu 
Gin Gly Leu 



Arg Val Glu Ser Val Pro Gly Tyr Pro Arg Arg Leu His 
225 230 235 



Ala Ser Trp 
240 



Thr Tyr Pro Ala Ser Trp Arg Arg Gin Pro His Phe Leu Leu Lys Phe 
245 250 255 

Arg Leu Gin Tyr Arg Pro Ala Gin His Pro Ala Trp Ser Thr Val Glu 
260 265 270 



Pro lie 



His Ala 
290 

Ser Ala 
305 



Gly Leu Glu 
275 

Val Arg Val 
Trp Ser Pro 



Glu Val lie Thr Asp Ala 
280 



Ser Ala 
295 

Glu Ala 
310 



Arg Asp Phe Leu 



Val Ala 
285 

Asp Ala 
300 



Trp Gly Thr Pro Ser Thr 
315 



Gly Leu Pro 
Gly Thr Trp 



Gly Pro Leu 
320 



Gin Asp Glu lie Pro Asp Trp Ser Gin Gly His Gly Gin Gin Leu Glu 
325 330 335 

Ala Val Val Ala Gin Glu Asp Ser Pro Ala Pro Ala Arg Pro Ser Leu 
340 345 350 



Gin Pro 



Ala Val 
370 

Gly Ala 
385 

Asp Gly 



Asp Pro Arg 
355 

Leu Ala Ser 
Leu Ala Leu 



Pro Gin Lys 
405 



Pro Leu Asp His Arg Asp 
360 



Leu Gly 
375 

Gly Leu 
390 



lie Phe Ser Cys 



Trp Leu Arg Leu 
395 



Pro Gly Leu Leu Ala Pro 
410 



Pro Leu 
365 

Leu Gly 
380 

Arg Arg 
Met lie 



Glu Gin Val 
Leu Ala Val 



Ser Gly Lys 
400 

Pro Val Glu 
415 



Lys Leu Pro Gly lie Pro Asn Leu Gin Arg Thr Pro Glu Asn Phe Ser 
420 425 430 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1800 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 128.. 1396 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

TCTAACAGCC TTACCCCACT TGGTGCATCA ATTTTTCTCC TAGGAAGCCT CAGTTTTGGA 60 

GAGGAAGAGC CAGGCTTTAG CTCCCATCTC AGGGGTCGGG GATTTTTGAC TCTACCTCTC 120 

CCCACAG ATG AGC AGC AGC TGC TCA GGG CTG AGC AGG GTC CTG GTG GCC 169 
Met Ser Ser Ser Cys Ser Gly Leu Ser Arg Val Leu Val Ala 
15 10 

GTG GCT ACA GCC CTG GTG TCT GCC TCC TCC CCC TGC CCC CAG GCC TGG 217 
Val Ala Thr Ala Leu Val Ser Ala Ser Ser Pro Cys Pro Gin Ala Trp 
15 20 25 30 

GGC CCC CCA GGG GTC CAG TAT GGG CAG CCA GGC AGG TCC GTG AAG CTG 265 
Gly Pro Pro Gly Val Gin Tyr Gly Gin Pro Gly Arg Ser Val Lys Leu 
35 40 45 

TGT TGT CCT GGA GTG ACT GCC GGG GAC CCA GTG TCC TGG TTT CGG GAT 313 
Cys Cys Pro Gly Val Thr Ala Gly Asp Pro Val Ser Trp Phe Arg Asp 
50 55 60 

GGG GAG CCA AAG CTG CTC CAG GGA CCT GAC TCT GGG CTA GGG CAT GAA 361 
Gly Glu Pro Lys Leu Leu Gin Gly Pro Asp Ser Gly Leu Gly His Glu 
65 70 75 

CTG GTC CTG GCC CAG GCA GAC AGC ACT GAT GAG GGC ACC TAC ATC TGC 4 09 

Leu Val Leu Ala Gin Ala Asp Ser Thr Asp Glu Gly Thr Tyr lie Cys 
80 85 90 

CAG ACC CTG GAT GGT GCA CTT GGG GGC ACA GTG ACC CTG CAG CTG GGC 457 
Gin Thr Leu Asp Gly Ala Leu Gly Gly Thr Val Thr Leu Gin Leu Gly 
95 100 105 110 
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TAC CCT CCA GCC CGC CCT GTT GTC TCC TGC CAA GCA GCC GAC TAT GAG 505 
Tyr Pro Pro Ala Arg Pro Val Val Ser Cys Gin Ala Ala Asp Tyr Glu 
115 120 125 

AAC TTC TCT TGC ACT TGG AGT CCC AGC CAG ATC AGC GGT TTA CCC ACC 553 
Asn Phe Ser Cys Thr Trp Ser Pro Ser Gin lie Ser Gly Leu Pro Thr 
130 135 140 

CGC TAC CTC ACC TCC TAC AGG AAG AAG ACA GTC CTA GGA GCT GAT AGC 601 
Arg Tyr Leu Thr Ser Tyr Arg Lys Lys Thr Val Leu Gly Ala Asp Ser 
145 150 155 

CAG AGG AGG AGT CCA TCC ACA GGG CCC TGG CCA TGC CCA CAG GAT CCC 649 
Gin Arg Arg Ser Pro Ser Thr Gly Pro Trp Pro Cys Pro Gin Asp Pro 
160 165 170 

CTA GGG GCT GCC CGC TGT GTT GTC CAC GGG GCT GAG TTC TGG AGC CAG 697 
Leu Gly Ala Ala Arg Cys Val Val His Gly Ala Glu Phe Trp Ser Gin 
175 180 185 190 

TAC CGG ATT AAT GTG ACT GAG GTG AAC CCA CTG GGT GGT GCC AGC ACA 745 
Tyr Arg lie Asn Val Thr Glu Val Asn Pro Leu Gly Gly Ala Ser Thr 
195 200 205 

CGC CTG CTG GAT GTG AGC TTG CAG AGC ATC TTG CGC CCT GAC CCA CCC 793 
Arg Leu Leu Asp Val Ser Leu Gin Ser lie Leu Arg Pro Asp Pro Pro 
210 215 220 

CAG GGC CTG CGG GTA GAG TCA GTA CCA GGT TAC CCC CGA GGC CTG CGA 841 
Gin Gly Leu Arg Val Glu Ser Val Pro Gly Tyr Pro Arg Arg Leu Arg 
225 230 235 

GCC AGC TGG ACA TAC CCT GCC TCC TGG CCG TGC CAG CCC CAC TTC CTG 889 
Ala Ser Trp Thr Tyr Pro Ala Ser Trp Pro Cys Gin Pro His Phe Leu 
240 245 250 

CTC AAG TTC CGT TTG CAG TAC CGT CCG GCG CAG CAT CCA GCC TGG TCC 93 7 

Leu Lys Phe Arg Leu Gin Tyr Arg Pro Ala Gin His Pro Ala Trp Ser 
255 260 265 270 

ACG GTG GAG CCA GCT GGA CTG GAG GAG GTG ATC ACA GAT GCT GTG GCT 985 
Thr Val Glu Pro Ala Gly Leu Glu Glu Val He Thr Asp Ala Val Ala 
275 280 285 

GGG CTG CCC CAT GCT GTA CGA GTC AGT GCC CGG GAC TTT CTA GAT GCT 1033 
Gly Leu Pro His Ala Val Arg Val Ser Ala Arg Asp Phe Leu Asp Ala 
290 295 300 

GGC ACC TGG AGC ACC TGG AGC CCG GAG GCC TGG GGA ACT CCG AGC ACT 1081 
Gly Thr Trp Ser Thr Trp Ser Pro Glu Ala Trp Gly Thr Pro Ser Thr 
305 310 315 

GGG ACC ATA CCA AAG GAG ATA CCA GCA TGG GGC CAG CTA CAC ACG CAG 1129 
Gly Thr He Pro Lys Glu He Pro Ala Trp Gly Gin Leu His Thr Gin 
320 325 330 

CCA GAG GTG GAG CCT CAG GTG GAC AGC CCT GCT CCT CCA AGG CCC TCC 1177 
Pro Glu Val Glu Pro Gin Val Asp Ser Pro Ala Pro Pro Arg Pro Ser 



7 



335 



340 



345 



350 



CTC CAA CCA CAC CCT CGG CTA CTT GAT CAC AGG GAC TCT GTG GAG CAG 1225 
Leu Gin Pro His Pro Arg Leu Leu Asp His Arg Asp Ser Val Glu Gin 
355 360 365 

GTA GCT GTG CTG GCG TCT TTG GGA ATC CTT TCT TTC CTG GGA CTG GTG 1273 
Val Ala Val Leu Ala Ser Leu Gly He Leu Ser Phe Leu Gly Leu Val 
370 375 380 

GCT GGG GCC CTG GCA CTG GGG CTC TGG CTG AGG CTG AGA CGG GGT GGG 1321 
Ala Gly Ala Leu Ala Leu Gly Leu Trp Leu Arg Leu Arg Arg Gly Gly 
385 390 395 

AAG GAT GGA TCC CCA AAG CCT GGG TTC TTG GCC TCA GTG ATT CCA GTG 1369 
Lys Asp Gly Ser Pro Lys Pro Gly Phe Leu Ala Ser Val He Pro Val 
400 405 410 

GAC AGG CGT CCA GGA GCT CCA AAC CTG TAGAGGACCC AGGAGGGCTT 1416 
Asp Arg Arg Pro Gly Ala Pro Asn Leu 
415 420 

CGGCAGATTC CACCTATAAT CCTGTCTTGC TGGTGTGGAT AGAAACCAGG CAGGACAGTA 1476 

GATCCCTATG GTTGGATCTC AGCTGGAAGT TCTGTTTGGA GCCCATTTCT GTGAGACCCT 153 6 

GTATTTCAAA TTTGCAGCTG AAAGGTGCTT GTACCTCTGA TTTCACCCCA GAGTTGGAGT 1596 

TCTGCTCAAG GAACGTGTGT AATGTGTACA TCTGTGTCCA TGTGTGACCA TGTGTCTGTG 1656 

AAGCAGGGAA CATGTATTCT CTGCATGCAT GTATGTAGGT GCCTGGGGAG TGTGTGTGGG 1716 

TCCTTGGCTC TTGGCCTTTC CCCTTGCAGG GGTTGTGCAG GTGTGAATAA AGAGAATAAG 1776 

GAAGTTCTTG GAGATTATAC TCAG 1800 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 423 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ser Ser Ser Cys Ser Gly Leu Ser Arg Val Leu Val Ala Val Ala 
15 10 15 

Thr Ala Leu Val Ser Ala Ser Ser Pro Cys Pro Gin Ala Trp Gly Pro 
20 25 30 

Pro Gly Val Gin Tyr Gly Gin Pro Gly Arg Ser Val Lys Leu Cys Cys 
35 40 45 

Pro Gly Val Thr Ala Gly Asp Pro Val Ser Trp Phe Arg Asp Gly Glu 
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50 55 60 

Pro Lys Leu Leu Gin Gly Pro Asp Ser Gly Leu Gly His Glu Leu Val 
65 70 75 80 

Leu Ala Gin Ala Asp Ser Thr Asp Glu Gly Thr Tyr lie Cys Gin Thr 
85 90 95 

Leu Asp Gly Ala Leu Gly Gly Thr Val Thr Leu Gin Leu Gly Tyr Pro 
100 105 110 

Pro Ala Arg Pro Val Val Ser Cys Gin Ala Ala Asp Tyr Glu Asn Phe 
115 120 125 

Ser Cys Thr Trp Ser Pro Ser Gin lie Ser Gly Leu Pro Thr Arg Tyr 
130 135 140 

Leu Thr Ser Tyr Arg Lys Lys Thr Val Leu Gly Ala Asp Ser Gin Arg 
145 150 155 160 

Arg Ser Pro Ser Thr Gly Pro Trp Pro Cys Pro Gin Asp Pro Leu Gly 
165 170 175 

Ala Ala Arg Cys Val Val His Gly Ala Glu Phe Trp Ser Gin Tyr Arg 
180 185 190 

lie Asn Val Thr Glu Val Asn Pro Leu Gly Gly Ala Ser Thr Arg Leu 
195 200 205 

Leu Asp Val Ser Leu Gin Ser lie Leu Arg Pro Asp Pro Pro Gin Gly 
210 215 220 

Leu Arg Val Glu Ser Val Pro Gly Tyr Pro Arg Arg Leu Arg Ala Ser 
225 230 235 240 

Trp Thr Tyr Pro Ala Ser Trp Pro Cys Gin Pro His Phe Leu Leu Lys 
245 250 255 

Phe Arg Leu Gin Tyr Arg Pro Ala Gin His Pro Ala Trp Ser Thr Val 
260 265 270 

Glu Pro Ala Gly Leu Glu Glu Val lie Thr Asp Ala Val Ala Gly Leu 
275 280 285 

Pro His Ala Val Arg Val Ser Ala Arg Asp Phe Leu Asp Ala Gly Thr 
290 295 300 

Trp Ser Thr Trp Ser Pro Glu Ala Trp Gly Thr Pro Ser Thr Gly Thr 
305 310 315 320 

lie Pro Lys Glu lie Pro Ala Trp Gly Gin Leu His Thr Gin Pro Glu 
325 330 335 

Val Glu Pro Gin Val Asp Ser Pro Ala Pro Pro Arg Pro Ser Leu Gin 
340 345 350 

Pro His Pro Arg Leu Leu Asp His Arg Asp Ser Val Glu Gin Val Ala 
355 360 365 
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Val Leu Ala Ser Leu Gly He Leu Ser Phe Leu Gly Leu Val Ala Gly 
370 375 380 



Ala Leu Ala Leu Gly Leu Trp Leu Arg Leu Arg Arg Gly Gly Lys Asp 
385 390 395 400 

Gly Ser Pro Lys Pro Gly Phe Leu Ala Ser Val He Pro Val Asp Arg 
405 410 415 



Arg Pro Gly Ala Pro Asn Leu 
420 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..2 

(D) OTHER INFORMATION: /note= "R at Position 1 is A or G" 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 7.. 8 

(D) OTHER INFORMATION: /note= 11 Y at Position 7 is C or T" 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 10.. 11 

(D) OTHER INFORMATION: /note= "R at Position 10 is A or G" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



RCTCCAYTCR CTCCA 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..2 



(D) OTHER INFORMATION: /note= "R at Position 1 is A or G 
(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 7.. 8 

(D) OTHER INFORMATION: /note= "R at Position 7 is A or G 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 10.. 11 

(D) OTHER INFORMATION: /note= "R at Position 10 is A or 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
RCTCCARTCR CTCCA 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..2 

(D) OTHER INFORMATION: /note= "R at Position 1 is A or G 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 7.. 8 

(D) OTHER INFORMATION: /note= "N at Position 7 is N" 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 10.. 11 

(D) OTHER INFORMATION: /note= »R at Position 10 is A or 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
RCTCCANGCR CTCCA 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION: 1..2 

(D) OTHER INFORMATION: /note= "R at Position 1 is A or G 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 7.. 8 

(D) OTHER INFORMATION: /note= "N at Position 7 is N" 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 10.. 11 

(D) OTHER INFORMATION: /note= "R at Position 10 is A or G 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



RCTCCANGGR CTCCA 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..2 

(D) OTHER INFORMATION: /note= "R at Position 1 is A or G" 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 7.. 8 

(D) OTHER INFORMATION: /note= n Y at Position 7 is C or T" 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 10.. 11 

(D) OTHER INFORMATION: /note= »R at Position 10 is A or G 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



RCTCCAYTTR CTCCA 



(2) INFORMATION FOR SEQ ID NO: 11: 

12 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
TGGTCCACGG TGGAGCCCAT TGGCT 
(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
CCACACGCGG TACGAGTCAG TGCCAGGGAC 
(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
AGCAAGTTCA GCCTGGTTAA G 
(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
CTTATGAGTA TTTCTTCCAG GGTA 
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(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

CCCTTCATTG ACCTCAACTA CATG 



(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
CATGCCAGTG AGCTTCCCGT TCAG 
(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
GGGTCCTCCA GGGGTCCAGT ATGG 
(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
GGAGGCCTCC AGAGGGT 

14 



(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
CTCCTGTACT TGGAGTCCAG G 
(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
GGAAAGCTGT GGCGTGATGG CCGTGGGGCA 
(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
GGGCGGAGGC CGCTGGCGGG CG 
(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 

TTATCAGCTG AAGTTCTCTG GGG 
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(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..2 

(D) OTHER INFORMATION: /note= "R at Position 1 is G or A" 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 7.. 8 

(D) OTHER INFORMATION: /note= "N at Position 7 is N" 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 10.. 11 

(D) OTHER INFORMATION: /note= "R at Position 10 is G or A" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



RCTCCANGCR CTCAA 



(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
ATCTTCTAGA TCCCCCTGCC CCCAAGCT 



(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
ACTTTCTAGA TTATTGCTCC AAGGGGTCCC TGTG 



